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Abstract. We consider the set of all graphs on n labeled vertices with prescribed 
degrees D = (di, ... ,d n ). For a wide class of tame degree sequences D we obtain 
a computationally efficient asymptotic formula approximating the number of graphs 
within a relative error which approaches as n grows. As a corollary, we prove 
that the structure of a random graph with a given tame degree sequence D is well 
described by a certain maximum entropy matrix computed from D. We also establish 
an asymptotic formula for the number of bipartite graphs with prescribed degrees 
of vertices, or, equivalently, for the number of 0-1 matrices with prescribed row and 
column sums. 



CO 

^ ■ 1. Introduction and main results 

(1.1) Graphs and their degree sequences. Let D = (d±, . . . ,d n ) be a vector 
of positive integers and let G(D) be the set of all graphs (undirected, with no loops 
or multiple edges) on the set {1, . . . , n} of vertices such that the degree of the k-th 
vertex is dk for k = 1, . . . ,n. Equivalently, G{D) is the set of all n x n symmetric 
matrices with 0-1 entries, zero trace and row (column) sums di, . . . , d n . We assume 
that 

(1.1.1) dx + ... + d n = Q mod 2, 

since otherwise the set G(D) is empty. 

The theorem of Erdds and Gallai, see, for example, Theorem 6.3.6 of [BR91], 
states the necessary and sufficient conditions for the existence of a graph with the 
given degree sequence. Without loss of generality, we assume that 

d\ > di > . . . > d n . 
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Then, the necessary and sufficient condition for G(D) to be non-empty is that 

(1.1.1) holds and 

k n 

(1.1.2) J2 d i ^ k(k-l)+ min{/c,^} for k = l,...,n. 

i=l i=k+l 

Our main goal is to estimate the cardinality \G(D) \ of G(D). Using the obtained 
estimate, we deduce a concentration result for a random graph G e G(D) sampled 
from the uniform probability measure on G(D). 

(1.2) The maximum entropy matrix and tame degree sequences. The 

following matrix plays the crucial role in our construction. 

Let us consider the space r( 2 ) of vectors x = {£{j,k})i where {j,k} is an un- 
ordered pair of indices 1 < j / < n. We consider the polytope V C rW, 
V = V{D), defined by the equations 

Yl £{i> fc > =dk for k = ' n 

and inequalities 

o < t UM < i. 

The integer points in V(D) correspond to the labeled graphs with degree sequence 
D, which we write as 

G(D) = v(D)nz^). 

We assume that V(D) is non-empty. We consider the following entropy function 
on V(D): 

H (*) = E (tuM ln -r— + i 1 ~ Sum) ln 737 — ) 

for x = (f {iifc} ) . 

Since if is a strictly concave function, it attains its maximum on ? at a unique 
point, z = (C{j,k})i z = Z (D), which we call the maximum entropy matrix associated 
with the degree sequence D. Matrix z can be easily calculated by interior point 
methods, see [NN94]. 

For < 5 < 1/2 we say that the degree sequence D is 5- tame if the polytope 
V(D) is non-empty and if 

^ < C{j,k} < 1 — S for all 1 < j 7^ k < n, 

where z = {C{j,k}) is the maximum entropy matrix associated with degree sequence 
D. In Theorem 2.1 we state some sufficient conditions for a degree sequence D to 
be tame. 
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(1.3) Quadratic form q and related quantities. Let z = (C{j,k}) be the max- 
imum entropy matrix associated with a tame degree sequence D. We consider the 
following quadratic form q : R n — > R, 

(1.3.1) q(t) = \^2 {C{j,k} - C{j,k}) ( T j + T kf for t = (n, ... ,r n ). 

It is easy to see that q is positive definite for n > 2. Let us consider the Gaussian 
probability measure on M. n with density proportional to e~ q . We define the following 
random variables /, h : M n — > M, 



= l i 1 - ^m) ( 2 ^{jm - x ) + rfc ) 3 and 



(1-3-2) 

1 

for t = (n, ... ,r n ) . 

Let 

// = E / 2 and v = Eh. 

Our main result is as follows. 

(1.4) Theorem. Let us fix < 5 < 1/2. Let D = (di, . . . , <i n ) 6e a 5-tame degree 
sequence such that d± + . . . + d n = mod 2, /e£ z = (C{j,k}) be the maximum 
entropy matrix as defined in Section 1.2 and let the quadratic form q and values 
of [i and v be as defined in Section 1.3. Let us define an n x n symmetric matrix 
Q = (ujk) by 

Ujk =({j,k} (i - C{j,k}) f° r J k and 

Ujj = d J - J2 tfm for 3 = ^---,n. 

k: k^j 

Then Q is positive definite and the value of 

(1.4.1) - - - exp (-^ + A 

V ; (2 7 r)"/ 2 v / detQ F l 2 J 

approximates the number of graphs \G(D)\ with degree sequence D within a relative 
error which approaches as n — > +oo. 

More precisely, for any < e < 1/2 the value of (1.4-1) approximates \G(D)\ 
within relative error e provided 



7 (5) 

n > 



') 
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where 7 = 7(5) is a positive constant. 
The main term 

2e H(z) 

(27r)"/ 2 v^eTQ 

of formula (1.4.1) is the "Gaussian approximation" formula of [BH10], whose ap- 
pearance, as is discussed in [BH10], is explained by the Local Central Limit Theo- 
rem, see also the discussion below. The factor 

exp H +u ) 

is the "Edgeworth correction" factor, see [BH09b]. In the course of the proof of 
Theorem 1.4, we establish a two-sided bound 

7i (S) < exp|-| + z/} < 72 (5) 

for some constants 71 (5), 72(6) > 0, as long as the degree sequence D remains 
5-tame. 

We note that computing the expectation of a polynomial with respect to the 
Gaussian probability measure is a linear algebra problem, cf. also Section 5.2. 
Hence apart from computing the maximum entropy matrix z, which can be done 
by interior point methods, computing the value of (1.4.1) is a linear algebra problem 
which can be solved in (9(n 4 ) time in the unit cost model. 

(1.5) Random graphs with prescribed degree sequences. Let us consider 
the set G(D) of all labeled graphs with degree sequence D as a finite probability 
space with the uniform measure. It is convenient to think of G G G(D) as of a 
subgraph of the complete graph K n with the set 

V={l,...,n} 

of vertices and the set 

E=[{j,k}: \<j^k<n] 

of edges. 

Let us sample a graph G G G(D) at random. What G is likely to look like? 

As a corollary of Theorem 1.4, we prove that with overwhelming probability, for 
a random graph G G G(D) the number of edges of G in a given set S C E with 
I jS' j = Q(n 2 ) is very close to the sum of the entries of the maximum entropy matrix 
indexed by the elements of S. 
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(1.4.2) 



(1.6) Theorem. Let us fix numbers k > and < 5 < 1/2. Then there exists a 
number ^(k, 5) > such that the following holds. 

Suppose that n > ^(k, 5) and that D = (di, . . . , d n ) is a d-tame degree sequence 
such that di + . . . + d n = mod 2. For a set S C E, let o~s(G) be the number of 
edges of graph G G G(D) that belong to set S and let 

{j,k}es 

where z = (C{j,k}) is the maximum entropy matrix. Suppose that \S\ > 5n 2 and 
let 

Ann 




If £ < 1 then for a uniformly chosen random graph G G G(D), we have 

P |g G G(D) : (l-e)as(z) < a s (G) < (l + e)a s (z)) > \-2n~™. 

The idea of the proof is as follows. For 1 < j ^ k < n, let be independent 

Bernoulli random variables such that 

P {x{j,k} = 1} = C{j,k} and P {x {ji k} = 0} = 1 - C{j,k}- 

As is shown in [BH10], the probability mass function of the random vector X = 
( x {j,k}) IS constant on the integer points of V(D) and is equal to e~ H ^ at each 
G G G(D), so that the vector X conditioned on G(D) is uniform. Theorem 1.4 
then implies that the probability that X G G(D) is not too small. On the other 
hand, standard large deviation inequalities imply that the sum k}eS x {jM con_ 
centrates about the value of crs(z) = k} C{j,k}- We supply the details of the 
proof in Section 10. 

In many respects random graphs G G G(D) behave like random graphs on the 
set {1, • • • , n} of vertices, with pairs {j, k} chosen as the edges of G independently 
with probabilities C{j,fc} 5 where z = (C{j,k}) is the maximum entropy matrix. As 
is discussed in [BH10], the distribution of the multivariate Bernoulli random vec- 
tor X = (x{j,k}) is the distribution of the largest entropy among all multivariate 
Bernoulli random vectors constrained by 

Ey k = d k for k = 1, . . . ,n 

where 

Vk = x iJ^}' 

We remark that we obtain the "Gaussian approximation" term (1.4.2) if we 
assume that the vector of random variables Y = (yi, . . . ,y n ) is asymptotically 
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Gaussian around its expectation (d±, . . . ,d n ). As it turns out, Y is not exactly 
Gaussian but is not very far from it. 

It looks plausible that both Theorem 1.4 and Theorem 1.6 can be extended to 
degree sequences D allowing a moderate number of entries C{j,k} °f the maximum 
entropy matrix to be arbitrarily close to 1 or 0. Our proofs, however, do not seem 
to allow such an extension with Theorem 4.1 being the main obstacle. Some of 
our proofs (mostly in Sections 5 and 6) are similar to those of [BH09a], where we 
applied the maximum entropy approach of [BH10] to count non-negative integer 
matrices with prescribed row and column sums. 

The paper is organized as follows. 

In Section 2, we give several examples and extensions concerning our main result, 
Theorem 1.4 and also discuss related work in the literature. 

In Section 3, we present an integral representation for the number |G(-D)| of 
graphs and also describe the plan of the proof of Theorem 1.4. 

The rest of the paper deals with the proofs. 



2. Examples and extensions 

Sometimes one can tell that a degree sequence is tame without computing the 
maximum entropy matrix. 

(2.1) Theorem. Let us fix real numbers < a < f3 < 1 such that 

(3 < 2y/a — a, or, equivalently, (a + (3) 2 < 4a. 

Then there exists a real number 5 = 5(a, jS) > and a positive integer no = no (a, j3) 
such that any degree sequence D = (d\, . . . , d n ) satisfying 

a < — — — < (3 for i = l,...,n 
n — 1 



is 8 -tame provided n > n . 
One can choose 



f 4(/3-a) 1 , 

n0 = maX (a^)' 4a-(a + ^ j + 1 ^ 

o — 7T where e = mm i a, a — 

1 + e 6 



For example, degree sequences D = (d±, . . . , d n ) satisfying 

0.25 < < 0.74 for i = l,...,n 

n — 1 
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or 

0.01 < < 0.18 for i = l,...,n 

n — 1 

or 

0.81 < < 0.89 for i = l,...,n 

n — 1 

are 5-tame for some 5 > and all sufficiently large n. 
We prove Theorem 2.1 in Section 12. 

(2.2) On the boundary of 5-tameness. Let us choose rational < a < (3 < 1 
such that 

(2.2.1) = 2^- a. 

Clearly, j3 > a. Let us choose a positive integer n such that an and /3n are even 
integers and let us consider the degree sequence 

d\ = . . . = dk = f3n and dk+i = ■ ■ ■ = d n = an for k = n\fa 

(note that k is necessarily integral). The Erdos-Gallai condition (1.1.2) for k = 
n^/a, reduces to 

(2.2.2) $ < 2y/a-a--. 

n 

In particular, (2.2.1) does not even guarantee that the polytope V(D) is non-empty. 

In [JSM92] Jerrum, Sinclair and McKay discuss under what conditions an ap- 
proximation formula for which depends "smoothly" on D may exist. They 
describe the phenomenon of the number of graphs | (?(£)) | changing sharply when 
the degree sequence D is varying only slightly around some special values of D. This 
phenomenon is apparently explained by the fact that the dimension of the polytope 
V(D) may change abruptly or the polytope may disappear altogether when D lies 
on the boundary of the Erdos-Gallai conditions (1.1.2). Theorem 8.1 of [JSM92] 
states that for 

d + = max{<ij, i = 1, . . . , n} and d- = min{<ij, i = 1, . . . , n}, 
as long as 

(2.2.3) (d+ -d-+ l) 2 < Ad- (n-d+-l), 

the degree sequence D is P-stable, meaning that increasing one of the degrees di 
and decreasing another by 1 does not change |G(£))| by more than a factor of n 10 
(this, in turn, implies that there are polynomial time randomized approximation 
algorithms for computing \G(D)\ and sampling a random graph G G G(D)). The 
condition of our Theorem 2.1 is only marginally stronger than (2.2.3). 

As Sourav Chatterjee pointed out to us, Lemma 4.1 of recent [CDS11] shows 
that a sequence D is 5-tame provided it lies sufficiently deep inside the polyhedron 
defined by the Erdos-Gallai conditions (1.1.2). 

Our example shows that the bounds of Theorem 2.1 are essentially the best 
possible if we take into account only the largest and the smallest degree of a vertex 
of the graph. 
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(2.3) Regular graphs. In [MW90] McKay and Wormald compute the asymptotic 
of \G(D)\ for regular graphs, where 



di = . . . = d n = d, 

and almost regular graphs, where 

\di — d\ < n^ +e for i = l,...,n 

for a sufficiently small e > 0; see also [McKll] for recent developments and [McKlO] 
for a survey. One can show that the formula of Theorem 1.4 is equivalent to the 
asymptotic formula of [MW90] for regular or almost regular graphs. 
In the case of regular graphs, symmetry requires that 

C{j,k} = — 3-r for all l<j^k<n 

for the maximum entropy matrix z = (C{j,k})- 

(2.4) Approximations in the cut norm. The cut norm (sometimes called the 
normalized cut norm) of a real m x n matrix A = (cijk) is defined by 



ll^llcut = — max 
mn J,k 



^2 a i k 

jeJ, keK 



where the maximum is taken over all non-empty subsets J C {1, . . . , m} and K C 
{1, . . . , n}. Let us choose set S in Theorem 1.6 of the form 

S = k} : j e J, k e K, j ^ fcj for some J, K C {1, . . . , n}. 

We note that there are not more than 2 2n distinct sets S of this form. Theorem 
1.6 implies that as n grows, the maximum entropy matrix z(D) approximates the 
adjacency matrix of the overwhelming majority of graphs G G G(D) within an error 
of O (n -1 / 2 Inn) in the cut norm. 

Shortly after the first version of this paper appeared, using a different approach, 
Chatterjee, Diaconis and Sly [CDS11] described graph limits of graphs from G(D) 
as n grows. A graph limit is a certain function on [0, 1] x [0, 1], viewed as an "infinite 
matrix" , which naturally arises as a limit object for a Cauchy sequence in the cut 
norm of adjacency matrices of graphs [LS06]. Graph limits constructed in [CDS11] 
can indeed be viewed as "infinite maximum entropy matrices" . 



(2.5) Enumeration of bipartite graphs. A natural version of the problem con- 
cerns enumeration of labeled bipartite graphs with a given degree sequence or, equiv- 
alently, m x n matrices with 0-1 entries and prescribed row sums R = (r±, . . . , r m ) 
and column sums C = (ci, . . . , c n ). We assume that 

r\ + ■ ■ ■ + r m = ci + . . . + c n . 

A simple necessary and sufficient condition for a 0-1 matrix with prescribed row 
and column sums to exist is given by the Gale-Ryser Theorem, see, for example, 
Corollary 6.2.5 of [BR91]. 

Let us consider the polytope V{R, C) of m x n matrices x = (Cjk) defined by the 
equations 

n m 

5^ €jk = Tj for j = 1,... ,m and £ jk = c k for k = 1, . . . , n 

k=l j=l 

and inequalities 

< Z,jk < 1 for all j, k. 

Let us compute the maximum entropy matrix z = (Cjk) as the necessarily unique 
matrix z G V(R, C) that maximizes 

H{x) = Y J Uk In r + ^-^lnrT-) for x = fah) 

on V{R, C). For < 5 < 1/2, we say that the margins (i?, C) are 5-tame if 

5m < n and 5n < m 

and 

^ < < 1 — 5 for all j, fc. 

Suppose that the margins (R, C) are indeed 5-tame for some 5 > 0. Let us define 
a quadratic form q : M m+n — y R by 



(2-5.1) 2 Tk 



Let 



f ) =\ fc' fc ~~ tfk) ( a J + T kf 

for (s,t) = (cri, . . . ,<7 m ;ri, . . . ,r n ) . 



(2.5.2) u = 1,... ,1;-1,... ,-1 



m times n times 
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and let L = u 1 - be the orthogonal complement to u in M. m+n . Then the restriction 
q\L of q onto L is strictly positive definite and we define det q\L as the product of 
the non-zero eigenvalues of q. We consider the Gaussian probability measure on L 
with density proportional to e~ q and define random variables f,g:L — > H. by 

f( s > *) = ^ E ^ ~ ( 2 ^' fc ~ ^ + Tfc ) 3 and 



(2 - 5 - 3) M*. *) =^ E 0* C 1 - 0*) ( 6 & - 6 0* + !) ^ + 

for (s,t) = (tn,... ,cr m ;n,... ,r„). 



We define 

// = E f and 1/ = E h. 
Then the number \R, C\ of 0-1 matrices with row sums R and column sums C is 

(2-5.4) \R, C\ = ; H(2) ^ + ^ exp {-^ + A (l + o(l)) 

K 1 11 (47r)(-+-- 1 )/ 2 v ^et^L P l 2 \\ K) ) 

provided m, n — > +oo in such a way that the margins (R, C) remain 5-tame for 
some 5 > 0. We sketch the proof of (2.5.4) in Section 11. 

Canfield and McKay [CM05] obtained an asymptotic formula of \R,C\ when all 
row sums are equal, r\ = . . . = r m and all column sums are equal, c\ = . . . = c n , 
which was later extended to the case of "almost equal" row sums and "almost 
equal" column sums [CGM08], see also [GM09]. The maximum entropy matrix z 
was introduced in [BalO] where a cruder asymptotic formula 

]n\R,C\ w H{z) 

was established without the 5-tameness assumption and for a wider class of enu- 
meration problems, including enumeration of 0-1 matrices with prescribed row and 
column sums and zeros in prescribed position. It was also shown in [BalO] that a 
random matrix 0-1 with prescribed row and column sums concentrates about the 
maximum entropy matrix z. 

3. An integral representation for the number of graphs 

In [BH10] we proved the following general result; see Theorem 5, Lemma 11 and 
formula (16) there. 

(3.1) Theorem. Let P C MP be a polyhedron defined by the system of linear 
equations Ax = b, where A is a nx p matrix with columns ai, . . . , a p G Z n and b e 
Z n is an integer vector, and inequalities < x < 1 (the inequalities are understood 
coordinate-wise) . Suppose that P has a non-empty interior, that is, contains a point 
x = (£i, ... , £ p ) such that < £ j < 1 for j = 1, . . . ,p. 
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Then the function 

#(*) = E(e>^ + (l-0)ln^) for *=(£i,...,£ p ) 

attains its maximum on P at a unique point z = (£1, . . . , £ p ) siic/i that < Q < 1 
for j = 1,... ,p. 

Let us consider the parallelepiped II = [— 7r, 7r] n , II C IR n . T/ien t/ie number 
\P fl {0, 1} P | o/ 0-1 points in P can be written as 

\P n {0, in = ^J n e~ l{t ' b) f[ (l - + <,e*<°"«>) eft, 

where (•, •) is t/ie standard scalar product in M. n , dt is the standard Lebesgue measure 
in W 1 and i = a/— T. 

□ 

The idea of the proof is as follows. Let X = (xi, . . . ,x p ) be a random vec- 
tor of independent Bernoulli random variables such that P {xj = 1} = Cj an d 
P {xj = 0} = 1 — Q for j = 1, . . . ,p. It turns out that the probability mass func- 
tion of X is constant on the set P fl {0, 1} P and equals e~ H ^ for every 0-1 point 
in P. Letting Y = AX, we obtain 

\P n {0, = e H(z) P {X eP} = e H(z) P {Y = b} 

and the probability in question is written as the integral of the characteristic func- 
tion of Y. 
Since 

p 

5^0 a. j = b, 
i=i 

in a neighborhood of the origin t = the integrand can be written as 

p 

e -i(t,b) "Q (l_Q + Q e i{ aj ,t)^ = 

r ! p 

ex P -sEO'^-OlM 2 
I J=l 

• p 

(3.2) + (1 -0) (20 -!)(«„ t) 3 

+ ^E0(!-Ci) (6^-60 + 1) (a J5 t) 4 



+ O(^(0 + l) 5 (a J ,t} ; 
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Note that the linear term is absent in the expansion. 
We obtain the following corollary. 

(3.3) Corollary. Let D = (d\, . . . , d n ) be a degree sequence such that the polytope 
V(D) defined in Section 1.2 has a non-empty interior and let z = (C{j,k}) be the 
maximum entropy matrix. Let 

F(t) =exp J -i J2 d m r m 1 J] (l - + C {m e^ + ^) 

I m=l ) {j,k} 

for t = (n, . . . ,r n ) . 
Then for the parallelepiped IT = [—it, tv] u , we have 

,H{z) 



Proof. Follows by Theorem 3.1. □ 

We note that in the case of regular and almost regular graphs (see Section 2.3) the 
integral of Corollary 3.3 is the same as the one evaluated by McKay and Wormald 
[MW90]. 

(3.4) Plan of the proof of Theorem 1.4. We use the integral representation 
of Corollary 3.3. Let us define subsets U, W C IT by 



U= |(ri,... ,r n ) : \Tj\ < for j = l,...,n| 

and 

r In n 

W =\ (ri, . . . , T n ) : It,- — <7 7 -7r| < — r= for some a,- = ±1 
I V n 

and j = 1, . . . , n j. 

We show that the integral of F(t) over II \ (U U W) is asymptotically negligible. 
Namely, in Section 8 we prove that the integral 



\F(t)\ dt 

'n\(wuw) 

is asymptotically negligible compared to the integral 

(3.4.1) f \F(t)\ dt. 

Ju 

12 



It is easy to show that 

/ F(t) dt= I F(t) dt, 
Ju Jw 
provided di + . . . + d n is even. 
In Section 7, we evaluate 

(3.4.2) / F(t) dt. 

Ju 

In particular, we show that the integrals (3.4.1) and (3.4.2) have the same order of 
magnitude and so the integral of F(t) outside of U U W is indeed asymptotically 
irrelevant. 

From (3.2) one can deduce that asymptotically as n — > +oo, 

F(t)&exp{-q(t) + if(t) + h(t)} for teU, 

where q is defined by (1.3.1) and / and h are defined by (1.3.2). 

Let us consider the Gaussian probability measure in R n with density proportional 
to e~ q . In Section 6, we prove that with respect to that measure 

(3.4.3) h(t) Eh = v almost everywhere in U. 
This allows us to conclude that 

/ exp{-q(t) +if(t) + h(t)} dt ^ e v I exp{-q(t)+if(t)}dt. 
Ju ' Ju 

In Section 5, we prove that asymptotically, as n — > +oo, function / is a Gaussian 

random variable, so 



(3.4.4) 



/ exp{-q(t) + if {t)} dt ~ / exp{-q(t) + if(t)} dt 
Ju Jm. n 

^ exp |_I E / 2 |^e-^ dt, 



which concludes the evaluation of (3.4.2). 

The crucial consideration used in proving (3.4.3) and (3.4.4) is that with respect 
to the Gaussian probability measure in WL n with density proportional to e~ q , the 
coordinate functions Ti, . . . , r n are weakly correlated, that is, 

|ETjTfc| =0 ( — j J for j 7^ k and 

(3.4.5) y 

Et, 2 =0 - for j = l,... ,n. 



3 \ n / 

We prove (3.4.5) in Section 4, where we essentially use the 5-tameness assumption. 

(3.5) Notation. By 7, sometimes with an index or a list of parameters, we denote 
a positive constant depending only on the listed parameters. The most common 
appearance will be 7(5), a positive constant depending only on the 5-tameness 
constant 5. 

As usual, for two functions g± and #2, where gi is no n- negative, we write g\ = 
0{g 2 ) if 1 01 1 < 702 and gx = 0(^ 2 ) if 0i > 702 for some 7 > 0. 
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4. Correlations 

Let z = (C{j,k}) be the maximum entropy matrix as defined in Section 1.2. We 
assume that 

< C{j,k} < 1 for all j^k. 
We define the quadratic form q : R n — > R by 

q W = lYl {^{JM ~ t{3,k}) fa + ^ fOT f = ( Tl » • • • ' • 

For ?i > 2 the quadratic form q is strictly positive definite. We consider the Gaussian 
probability measure on R n with density proportional to e~ q . We consider a point 
t = (ti, . . . , T n ) as a random vector and Ti, . . . , r n as random variables. 
The main result of this section is as follows. 

(4.1) Theorem. For any < S < 1/2 there exists ^(5) > such that the following 
holds. 

Suppose that 

5 < C{j,k} < 1-6 for all j ^ k. 

Then 

Erf < 

We will often consider the following situation. Let ip : M n — > R be a positive 
definite quadratic form. We consider the Gaussian probability measure in R n with 
density proportional to . For a polynomial (random variable) / : R n — > R we 
denote by E (/; ip) its expectation with respect to the measure. For a subspace 
L C R n , we consider the restriction ip\L of ip onto L and the Gaussian probability 
measure on L with density proportional to e _ ^' L . For a polynomial / : R n — > R, 
we denote by E (/; ip\L) the expectation of the restriction / : L — > R with respect 
to that Gaussian probability measure on L. We will use the following standard 
fact: suppose that R n = L\ © L 2 is a decomposition of R n into the direct sum of 
orthogonal subspaces such that 

ip (h + t 2 ) = ip (h) + ip (t 2 ) for all ti G Li and t 2 G L 2 , 

so that the coordinates t\ G L\ and t 2 G L 2 of the point t = t\ + t 2 , t G R n are 
independent. Let £i,£ 2 : R n — ► R be linear functions. Then 



7(d) 

— r- provided j ^ k and 

^ for , ! , 

n 



E (4^ 2 ; V) = E (44; Vl^i) + E (44; il>\L 2 ) . 
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Indeed, writing t = t\ + t 2 with t\ G L\ and £2 £ -^2 and noting that = 
^1,2(^1) +4,2(^2), we obtain 

E (£i(t)£ 2 (t); V) =E (£i(ti)£ 2 (ti); ^) + E (4(ti)^ 2 (t 2 ); VO 

+ e (4(t 2 K 2 (ti); ^) + e (e 1 (t 2 )e 2 (t2); VO 

=E (£i£ 2 ; #i) + 2E (£ i; V)E (£ 2 ; V) + E (4£ 2 ; ^2) 
=E {i x i 2 - ^|Li) + E (£i£ 2 ; ^1^2). 

We deduce Theorem 4.1 from the following result. 

(4.2) Proposition. Let n > 2 and Ze£ 1 < j V < n be a set of numbers 

such that 

a < £{j,k} < P for all j, k 

and some f3 > a > 0. 
Let 

°~k= Yl f° r k = !» • • • » n - 

n 



Let ms consider the quadratic form ip : 



{i,fc} w J 



R defined by 



for t = (ri,... ,r n ), 



where the sum is taken over all unordered pairs of indices 1 < j ^ k < n. Then if) is 
a positive definite quadratic form and we consider the Gaussian probability measure 
in R n with density proportional to . 
Let 



a 



Then for n > 2/e we have 



IE T 7 -Tfc I < 



n 



+ 



provided j ^ k and 



e 5 / 2 (n- e)(ne-2)(n- 1) 2en 

3 

+ - — for j = l,...,n. 



' ETj X l - e 5 /2( n _ e )(ne-2)(?i-l) ' 2en 



Proof. Clearly, i/> is positive definite. Let 



I Or 



Then v is an eigenvector of tp with eigenvalue 1. Indeed, the gradient of tp at t = v 
is 2v: 

d 1 

t = V 'T- 



fc: k^j 
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Let 

L = v ± c R n 

be the orthogonal complement to v. Hence L is defined in R n = {(ri, . . . , r n )} by 
the equation 

n 



We write 



3=1 UM ^ a ' n 



We remark that 

(4.2.1) a(n-l) < a 3 < (3{n - 1) for j = l,...,n. 

Let 

w = a.,- > «n(n — 1) 
and let us define the quadratic form 4> : M n — > R by 

i / n V 

^) = u ( S ^v 7 ^ ] for t =(r 1 ,... ,r n ). 



J =1 



Hence <b(t) is a form of rank 1 and i> is an eigenvector of <b with eigenvalue 1. 
We define a perturbation 



e 2 , n 



ib = ib 6 for e = — . 

Hence ib is a positive definite quadratic form such that 

$(t) = ib{t) for all t e L, 

and v is an eigenvector of ib with eigenvalue 1 — e 2 /2. 

Let us consider the Gaussian probability measure on IR n with density propor- 
tional to . Our immediate goal is to estimate the covariances E TjTk with respect 
to that measure. 

Denoting by (■, •) the standard scalar product in R n , we can write 



^) = I((/ + Q)t, t ), 
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where / is the n x n identity matrix and Q = (qjk) is an n x n symmetric matrix 
such that v is an eigenvector of Q with eigenvalue 1 — e 2 . We have 

Qjfc = - e for J T k and 



It follows by (4.2.1) that 
1 



= for j = l,...,n. 



e(n- 1) 



> QjA: > for j ^ k and 



> Qj j > for j = 1, . . . , n. 



n 



The covariance matrix R = (EiTjTk; ipj of the Gaussian measure with density 
proportional to is 



e x-l 



p=(i-1)- 1 (1/ + q). 

Hence P = (pjfc) is a symmetric matrix such that 

(4.2.2) < p jk < (I--)"' , 1 rr for all j, fc. 

\ n) e(n — 1) 

Furthermore, v is an eigenvector of P with eigenvalue (1 — e 2 + e/n)/(l — e/n), so 

(4.2.3) Pv = Xv for \= (l-^j 1 (l - e 2 + £ ) . 

Let us bound the entries of a positive integer power P d = {pfk) of P. Let 

K= ( 1_ ^) a 3/2(/_ 1)372 andlet ^ = / ™' V=(Vl,---,Vn). 

By (4.2.1) and (4.2.2), we have 

(4.2.4) p jk < 7]j for all j, fc. 
Also, by (4.2.1) we have 

(4.2.5) n , < (i-i)- 1 -3 7 ^i_ ly fora „ 
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Furthermore, y is an eigenvector of P with eigenvalue A defined by (4.2.3), and hence 
y is an eigenvector of the <i-th power P d = (pf^ with eigenvalue X d . Combining 
this with (4.2.4) and (4.2.5), for d > we obtain 

n n _i i 

Pjk ~ 2^ p j™ pmk - Z^Pj^m-AVj < A ^1 n j e 3 / 2 (n-l) 



m=l m=l 

We note that for n > 2/e we have < A < 1. Consequently, the series 



(i+py 1 = i +J2(- I ) dp0 



d=l 



converges absolutely and we can bound the entries of the matrix 

R=(I + Q)- 1 = (l-l) _1 (/ + p)-i, 



R = (r jk ) by 



33 



We have 



< 



1 



1 



II < 



e 3 / 2 (n-l) 1-A 

2 I I 



e 3 / 2 (n-l) 1-A 



if j ^ k and 

for j = 1, . . . , n. 



1-A V n) 



2e\~ l 



Since R is the covariance matrix of the Gaussian probability measure with density 
proportional to , we obtain 



(4.2.6) 



(rjT k ; V) 

{rf; i>) - 



< 



n 



e 5 / 2 (n-e)(ne-2)(n-l) 

,2 



< 



n 



e 5 / 2 (n-e)(ne-2)(n-l) 



provided j ^ k and 
for j = l,...,n. 



Now we go back to the form ip and the Gaussian probability measure with density 
proportional to . Since v is an eigenvector of both ijj and ip, since L = u 1 - and 
since ip and V coincide on L, for any linear functions £±,£2 '■ R n — > R, we have 



(4.2.7) 



E (h£2; V) =E (4£ 2 ; V>|£) + E (^ 2 ; V|span(i>)) 
=E (^ 2 ; +E (^ 2 ; VI span(z;)) 

=E (^ 2 ; -E (^ 2 ; V|span(z;)) 

+ E (4£ 2 ; V| span(v)) . 
18 



We note that the gradient of the coordinate function tj restricted to span(i>) is 
^Jcjj/uj. Since v is an eigenvector of ip with eigenvalue 1 and an eigenvector of ip 
with eigenvalue 1 — e 2 /2, we have 

E (TjT k ; VI span (v)) = ^ J and 



E (rjT k ; V|span(i>) 



2u 
{2-e 2 )u 



for all j, k. 



By (4.2.1) we have 



|E (TjT k ; VI span(v))| < 



1 



2en 



and 



E [TjTk; VI span(V 



1 

< — . 

en 



□ 



The proof now follows by (4.2.6) and (4.2.7). 

Now we are ready to prove Theorem 4.1. 
Proof of Theorem 4-1- Let us define 



km = C{j,k} ~ C 2 {j ,k} f o r a11 J^k 

and let us choose a = 5 — 5 2 and (3 = 1/4 in Proposition 4.2. We define aj and V 
as in Proposition 4.2 and consider a linear transformation 

("Tl, . . . , T n ) I > (Tly/o\, . . . ,T n y/On) . 

Then the push-forward of the Gaussian probability measure with density propor- 
tional to e~ q is the Gaussian probability measure with density proportional to . 
Therefore, 

E (TjT k ; q) = E (TjT k ; V) • 



Since 



□ 



o~j > o~a(n — 1) for j = 1, 

The proof follows by Proposition 4.2. 

We will need the following lemma. 

(4.3) Lemma. Let qo '■ M n — > K, n > 2, be the quadratic form defined by the 
formula 

)(*) = « ( T J + Tk ? f° T 1 = ( Tl > • • • ' r ") ' 



9ol 



T/ien £/ie eigenspaces of qo are as follows: the 1-dimensional eigenspace E\ with 
eigenvalue n — 1 spanned by the vector u = (1, . . . , 1) and i/ie (n — 1)- dimensional 
eigenspace E<i = u with eigenvalue (n — 2)/2. 
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Proof. We have 

Hence the gradient of qo(t) at t = u is (2n — 2)tt, so u is an eigenvector with 
eigenvalue (n — 1). For t e m we have ri + . . . + r n = and hence 

Therefore, the gradient of qo(t) at t E u is (n — 2)t, and so t is an eigenvector with 
eigenvalue (n — 2) /2. □ 

5. The third degree term 

The main result of this section is the following theorem. 

(5.1) Theorem. For unordered pairs {j, k}, 1 < j ^ k < n, let u^j^y be Gaussian 
random variables such that 

E«{ J ,t}=0 for all j,k. 
Suppose further that for some 9 > we have 



Q 

E «{j,fc} < - for all j,k 



and that 



Q 

\ Eu {jiM} u {j2,k 2 }\ ^ — provided h} n {j 2 , k 2 } = 0. 



rr 



Let 

~ l {j,ky 



Then for some constant 7(6*) > and any < e < 1/2 we have 

1. 



E exp {if/} - exp <j -|E t/ 2 



provided 

„ > (- 

Furthermore, 

EU 2 < 7 (0) 
20 



< e 



for some 7(6*) > 0. Here i = \f—l. 

We apply Theorem 5.1 in the following situation. Let q : E n — > E be the 
quadratic form defined by (1.3.1). 

Let us consider the Gaussian probability measure on E n with density propor- 
tional to e~ q . We define random variables uy ^y by 



u {j,k}( f ) = y \{{jM i 1 ~ C{j,fc}) { 2 C{j,k} ~ 1) (Tj + Tk) for t = (n, . . . , r n ) . 
Then for the function f(t) defined by (1.3.2) we have 

In this section, all implied constants in the "O" notation are absolute. 
Our main tool is Wick's formula for the expectation of the product of random 
Gaussian variables. 

(5.2) Wick's formula. Let w\, . . . ,wi be Gaussian random variables such that 

Ewi = ... = Ew;=0. 

Then 

E (wi ■ • • wi) =0 if / is odd and 

E (wi ...wi) (Ew^Wjz) ■ ■ ■ (Ewj^^) if I = 2r is even, 

where the sum is taken over all (2r)!/r!2 r unordered partitions of the set of indices 
{1, . . . , /} into r = 1/2 pairwise disjoint unordered pairs {ii, 12} , • • • , {ii-i, k}, see 
for example, [Zv97] . Such a partition is called a matching of the random variables 
wi, . . . ,wi and we say that Wi and Wj are matched if they form a pair in the 
matching. 

In particular, 

(5.2.1) E«* = &l (EW)' 

for a centered Gaussian random variable w. We will also use that 

(5.2.2) Ewfwl = 9 (Ewf) (E«) 2 2 ) (E Wl w 2 ) + 6 (E 
and later in Section 6 that 

2 cov (wi wi) =E (wfwl) - (E wf) (E 

=72(E«;i«; 2 ) 2 (EmJ (E wj) + 24 (E Wl w 2 f . 
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(5.3) Representing monomials by graphs. Let x^j^ : l<j^k<nhe 
formal commuting variables. We interpret a monomial in as a weighted graph 
as follows. Let K n be the complete graph with vertices 1, . . . , n and edges {j, k} 
for 1 < j 7^ k < n. A weighted graph G is a set of edges {j, k} of K n with positive 
integer weights a{j,fc} on them. The set of vertices of G consists of all vertices of 
the edges of G. With G, we associate a monomial 

m G (x)= x^ff. 
{j,k}eG 

The weight of G is the degree of mc(x), that is, k}eG a {o,k}- We observe that 
for any p there are not more than r°^n p distinct weighted graphs G of weight 2r 
on p vertices. 

In what follows, given a set of random variables, we construct auxiliary Gaussian 
random variables with the same matrix of covariances. This is always possible since 
the matrix of covariances is positive semi-definite. 

Our proof of Theorem 5.1 is based on the following combinatorial lemma. 

(5.4) Lemma. For the Gaussian random variables u^ :k } of Theorem 5.1, let us 
introduce auxiliary Gaussian random variables V{j ik y such that 



E v {^fcj =0 for all 1 < j < ^ k < n 

Ev {hM} v {hte} 



and 



Given a weighted graph G of weight 2r, r > 1, let us represent it as a vertex- disjoint 
union 

G = Gq U G\ , 

where Gq consists of s isolated edges of weight 1 each and G\ is a graph with no 
isolated edges of weight 1 (we may have s = and Gq empty). 
Then 



E 



(1) We have 

Em G (u 3 {jM : 1 < j ^ k < n) 
if s is even and 



|Em G (v {j)k} : 1 < j ^ k < n) \ < 



'{j,k} 



1 < j ^ k < n)\ < 



n 



r O(r)Q3r 

n 3r + s / 2 



„<D(r)Q3r 
3r+(s+l)/2 



if s is odd. 

(2) If s is even and G± is a vertex-disjoint union of r — s/2 connected com- 
ponents, each consisting of a pair of edges of weight 1 sharing exactly one 
common vertex, then 



E 



m G (u\ jik} : l<j^k<n^-Em G (v {jyk} : 1 < j ^ k < n) 



< 
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n 



3r+s/2+l ' 



(3) If s is even and Gi is a vertex- disjoint union of r — s/2 connected com- 
ponents, each consisting of a pair of edges of weight 1 sharing exactly one 
common vertex, then G has precisely 3r + s/2 vertices. In all other cases, 
G has strictly fewer than 3r + s/2 vertices. 

Proof. If /ci}n{,72, /C2} = we say that the pair of variables w^^}, W{j 2 ,fc 2 } and 
the pair of variables V{j l kl ^, V{j 2: k 2 } are wea kly correlated. If {ji, ki} n{j2, ^2} 7^ 
we say that the pairs of variables are strongly correlated. Pairs of variables indexed 
by edges in different connected components of G are necessarily weakly correlated. 
To prove Part (1) we use Wick's formula of Section 5.2. By (5.2.2), we obtain 

E { v {hM} v {hM}) = ° (J^z) if the P air V{3iMh v {hM} 
is weakly correlated, 

(5A1) 

E { v {hM} v {hM}) = ° [^3 J lf the P air v {hM}i v {hM} 
is strongly correlated. 

Since for each isolated edge {ji, k{\ e Gq variable v^j^^ has to be matched with 
variable V{j 2j k 2 } indexed by an edge {^'2,^2} in a different connected component, 
we conclude that every matching of the set 

(5.4.2) {v {jM : {j,k}eG} 

contains at least s/2 weakly correlated pairs and hence 



\Em G (v {m : l<j^k<n)\ < r°« ( 



q3 \ s /2 / q3 \ r-s/2 



n 4 / \n 3 



Moreover, if s is odd, then the number of weakly correlated pairs is at least (s + l)/2 
and hence 

\Ema(v lm : l<j^k<n)\ < r°<-> (^J (^) 

Similarly, since for each isolated edge {ji, k{\ E G at least one copy of the variable 
u {ji,ki} has to be matched with a copy of variable U{j 2j k 2 } indexed by an edge in a 
different connected component, we conclude that every matching of the multiset 

(5.4.3) { u {3,k}> U {j,k}> U {j,k} '■ {j' k } e G } 

contains at least s/2 weakly correlated pairs, and hence 

/ \ / a \ s / 2 / a \ 3r-s/2 

*mc(u\ jM : 1<J/*<„)| < r °W(^j (|) . 
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Moreover, if s is odd, then the number of weakly correlated pairs is at least (s + l)/2 
and hence 

/ a \ (*+l)/2 /q\ 3r-(a+l)/2 

Em G (^, fc} : 1<^*<»)| < r°W 

This concludes the proof of Part (1). 

To prove Part (2), let us define T, V (G) as the sum in the Wick's formula for 
E me { v {j,k} '■ 1 < 3 7^ k < nj taken over all matchings of the set (5.4.2) of the 
following structure: we split the edges of G into r pairs, pairing each isolated edge 
with another isolated edge and pairing each edge in a connected component of G 
consisting of two edges with the remaining edge in the same connected component. 
Then we match every variable V{j li f ei y with the variable f{j 2 ,fc 2 } such that {j±, ki} 
is paired with {j 2 , k 2 }. Reasoning as in the proof of Part (1), we conclude that 

\Em G (v {jM : l<j^k<n)-E v (G)\ < 



n 



3r+s/2+l ' 



since every matching of the set (5.4.2) which is not included in E„(G) contains at 
least s/2 + 1 weakly correlated pairs. 

Similarly, let us define T, U (G) as the sum in the Wick's formula for 

Emg ^u^. k y : 1 < j ^ k <nj taken over all matchings of the multiset (5.4.3) of 
the following structure: we split the edges of G into r pairs as above and match 
every copy of variable u^^y with a copy of variable W{j 2 ,fc 2 } indexed by an edge in 
the same pair (in particular, we may match copies of the same variable) . Reasoning 
as in the proof of Part (1), we conclude that 



E 



m G (u\ jM : l<j^k<n)-^ u {G) 



r O(r)03r 

< 



3r+s/2+l ' 



n 



since every matching of the multiset (5.4.3) which is not included in H U (G) contains 
at least s/2 + 1 weakly correlated pairs. 
The proof of Part (2) follows since 

E tt (G) = 

by Wick's formula. 

To prove Part (3), we note that a connected weighted graph G of weight e 
contains a spanning tree with at most e edges and hence has at most e + 1 vertices. 
In particular, a connected graph G of weight e contains fewer than 3e/2 vertices 
unless G is an isolated edge of weight 1 or a pair of edges of weight 1 each, sharing 
one common vertex. Therefore, G has at most 

3 s 
2s+-(2r-s) = 3r + - 

vertices and strictly fewer vertices, unless s is even and the connected components 
of Gi are pairs of edges of weight 1 each sharing one common vertex. □ 
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(5.5) Proof of Theorem 5.1. Let be Gaussian random variables defined 

in Lemma 5.4 and let 

0'.*} 

Since there are O (n 3 ) strongly correlated pairs Vj lt k 1 ,Vj 2j k 2 an d there are O (n 4 ) 
weakly correlated pairs, by (5.4.1) we have 

(5.5.1) EV 2 = EU 2 = O (6 3 ) . 

Since V is a Gaussian random variable, we have 



(5.5.2) 



Ee 



iV 



exp 



1. 



-E V 



exp 



-EW 



Our goal is to show that E e lV and E e lU are asymptotically equal as n 
By symmetry, the odd moments of U and V are 0: 

(5.5.3) EU k = EV k = if k > is odd. 

The even moments of and V can be expressed as 

EU 2r = Y,a G Vm G (u\ ]k} : l<j^k<n) 
G 

EV 2r =J2 a G Em G (v {m : 1 < j ^ k < n) , 



+oo. 



where the sum is taken over all weighted graphs G of weight 2r and 

1 < a G < (2r)L 

Let ^2r be the set of weighted graphs G of weight 2r whose connected components 
are an even number s of isolated edges of weight 1 and r — s/2 pairs of edges of 
weight 1 sharing one common vertex. Since there are no more than r°^n p distinct 
weighted graphs of weight 2r with p vertices, by Parts (1) and (3) of Lemma 5.4, 
we have 



EU 2r - a G -Em G (u\ jjk} : 1 < k < n) 



GeG 2 



EV 2r — a G Em G (v {jM : 1 < k < n) 

GEG2r 



< 



< 



r O(r)03r 

n 

r O(r)03r 

n 



and 



Therefore, by Part (2) of Lemma 5.4, 
(5.5.4) 



EU 2r — E V 2r \ 
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< 



•0(r)Q3r 



n 



From Taylor's Theorem 



2r-l 

it follows that 

|Ee lC/ -Ee iV \ < 



, x- 



< 



X 



2r 



(2r)! 



for i6l, 



EU 2r EV 2r 2 ^ \EU S - EV S \ 



+ 



(2r)! (2r)! 



s=0 



,•1 



From (5.5.1) and (5.2.1) we deduce that for a positive integer r we have 

33r 



Ey2r < (2r)!2°Mg £ 



Therefore, by (5.5.4) 



(5.5.5) 



Ee lC/ -Ee iV | < 



2 0(r)^3r r O(r)^3r 



+ 



n 



Given < e < 1/2, one can choose a positive integer r such that 



, 2 1 
/■In /■ = O I In — 



so that the first term in the right hand side of (5.5.5) does not exceed e/2. It follows 
then that for all 

■ x \ 7W 



we have 



«> t - 



Ee iU -Ee iV \ < e, 



and the proof follows by (5.5.2). □ 

6. The fourth degree term 

The main result of this section is the following theorem. 

(6.1) Theorem. For unordered pairs {j, k}, 1 < j ^ k < n, let w^j^y be Gaussian 
random variables such that 

Ew {j,k} =0 f° r al1 J> fe » 
and let cr^j^y G { — 1,1} be numbers. 
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Suppose further that for some 6 > we have 



e 

Ew {j,k} < - for all j,k, 



and that 



Q 

\ E w {jlM} w {hM} \ < — provided {j u h} n {j 2 , k 2 } = 



rr 



Let 

% M w {j,k}- 



W =Y1 a {jM w * 



Then for some constant 7(6*) > we have 
(1) 

E\W\ < 7(^)5 

(2) 

varW < 

n 

(3) 

P{\W\ > 7(0)} < exp{-n 1/5 j 

provided n > 71(6*) for some constant 71(6*) > 0. 

We apply Theorem 6.1 in the following situation. Let q : M. n — > H. be the 
quadratic form defined by (1.3.1). Let us consider the Gaussian probability measure 
on R n with density proportional to e~ q . We define random variables by 

W{j, k }{t) =\J^({j,k} (1 - C{j,k}) \ Q ({j,k} ' 6 C{j,fc} + lj fa + n) 
for t = (n, ... ,r n ) 

and let 

a {jM = sign (6C^ )fc} - G( {j ,k} + l) • 



Then for the function h defined by (1.3.2), we have 



h = ^2 °^ 



{j,k} 

While the proof of Parts (l)-(2) is done by a straightforward computation, to 
prove Part (3) we need reverse Holder inequalities for polynomials with respect to 
the Gaussian measure. 

27 



(6.2) Lemma. Let p be a polynomial of degree d in random Gaussian variables 
wi, . . . ,wi. Then for r > 2 we have 

(E\p\ r ) 1/r < r d / 2 (Ep 2 ) 1/2 . 

Proof This is Corollary 5 of [Du87]. □ 

(6.3) Proof of Theorem 6.1. All implied constants in the "O" notation below 
are absolute. 

By formula (5.2.1), 



E^. fc} = 3(E^. fc} ) 2 = 0^ 



and hence 

e\w\ = o (e 2 ) 

and Part (1) follows. Furthermore, 



var 



W = a {nM}°{j2M} c °v { w \nMY w {hM}) 

By (5.2.3) we have 



{hM} 
{hM} 



q4 



and, additionally, 



cov 



{ w thMV w {hM}) = °(^) P rovided {h,ki}n{j 2 ,k 2 } = fr 



Therefore, 

(Q4: 
— 
n 

which proves Part (2). 

Finally, applying Lemma 6.2 with d = 4 we deduce from (6.3.1) that for any 

r > 2 

E\W-EW\ r < r 2r n -r/2 2 0(r) 9 2r_ 

Choosing r = n 1 / 5 , we conclude that for all sufficiently large n > no(0) we have 

E\W-EW\ r < exp|-n 1/5 |. 

By Markov's inequality, we obtain 

p||W-EW|>l| < exp|-n 1/5 | 

for all sufficiently large n and the proof follows from Part (1). □ 
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7. Computing the integral over a neighborhood of the origin 
We consider the integral 

/ F(t) dt 
in 

of Corollary 3.3. Hence 



F(t) =exp i -i J2 d m r m V (l — C{j,k} + C {j , fc} e^ +Tfe) ) 



771=1 



for t = (n, ... ,r n ) , 

where D = (d±, . . . ,d n ) is a given degree sequence, z = (C{j,k}) is the maximum 
entropy matrix and IT is the parallelepiped [— 7r, 7r] n . We recall that the quadratic 
form q : R n — > R is defined by 

q W = lYl {^{jM - ({jm) ( t j + Tfc ) 2 for t = (r u ... , r n ) . 

In this section, we prove the following main result. 

(7.1) Theorem. Let us fix a number < 5 < 1/2 and suppose that 

5 < C{j,k} < 1-6 for all j ^ k. 

Let /, h : MJ 1 — > R be polynomials defined by (1.3.2). Let us define a neighborhood 
of the origin U C IT by 

f In n l 

U= < (ri,... ,r n ) : \r k \ < —t= for k = l,...,n>. 



Let 

e~ q{t) dt 



and let us consider the Gaussian probability measure in M. n with density S 1 e q . 
Let 

H = Ef 2 and v = Eh. 



Then 
(1) 



n 



(2) 

fx, \u\, E|/i| < 7 (<5) 
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for some constant 7(0") > 0; 
(3) For any0<e< 1/2 



/ \F(t)\ dt-exp{z/} S 
Ju 



< e S 



provided 



n > I - 



7(5) 



/or some constant 7(0") > 0; 
(4) For any 0< e < 1/2 



jf F(*) dt-exp {-% + "} S 



< e S 



provided 



n > 



7(5) 



/or some 7(5) > 0. 

Proof. 

In what follows, all constants implied in the "O" and "O" notation depend only 
on the parameter 5. 
Let 

i(*)=«E ( t j + r fc) 2 for * = (n, . . . , r n ) 



<?ol 



as in Lemma 4.3. Then q(t) < \qo{t) and hence 



q dt > e-^W dt = TT n / 2 



n — 1 \ n — 2 



71-1 

2 



> 



47T 



77. 



n/2 



which proves Part (1). 

Let us think of the coordinate functions Tj as random variables with respect to 
the Gaussian probability measure with density proportional to e~ q . 

By Theorem 4.1, we have 



(7.1.1) 



|Er 7 -r fc | = O 



Erf = O 



provided j 7^ k and 



for j = 1, 
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For an unordered pair 1 < j ^ k < n, let us define 



u 



Then 



Similarly, let us define 



<■'■ f = \7 ^C{i,fc} (1 " C{j,fc}) - 6C{j,fc} + l| {Tj + Tk) and 



k} =sign (6C { 2 Jife} - 6C {j)fc} + l) 



Then 



h = a h 



m w {j,ky 



By (7.1.1) the random variables satisfy the conditions of Theorem 5.1 and 

hence the upper bound for \i = E/ 2 follows by Theorem 5.1. Similarly, by (7.1.1) 
the random variables W{j,k} satisfy the conditions of Theorem 6.1 and hence the 
upper bound for \v\ = Eh and ~E\h\ follows by Part (1) of Theorem 6.1. This 
concludes the proof of Part (2) of the theorem. 

By Lemma 4.3, eigenvalues of q are fi(n) from which it follows that 



(7.1.2) 

lR n \U 

From Theorem 5.1, we have 



dt 



< exp{-0 (ln 2 n)}£. 



f e-m+im dt _ exp f_n E 

m o(i) 
provided n > I - 1 , 



< e H 



which, combined with (7.1.2), results in 



(7.1.3) 



J e -«(*)+i/(*) ^_ exp {-|}s 

provided n > I - 1 
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< e S 



By Part (2) of Theorem 6.1 and Chebyshev's inequality, we have 



(7.1.4) 



{|*-,|>«} = o(^) 



while by Part (3) of Theorem 6.1, we have 

(7.1.5) p{/i> 7 (5)} =0 (expj-n 1/5 }) 

for some constant *y(8) > 0. In addition, 

(7.1.6) \h(t)\ = O (In n) for teU. 

Combining (7.1.4)-(7.1.6) and Part (2) of the theorem, we deduce from (7.1.2) and 
(7.1.3) that 



f e -q(t)+h(t) ^_ exp {z/}S 
Ju 



< e H 



and 



(7.1.7) 



f e -a(t)+iHt)+h(t) dt _ exp in + i s 

Ju I 2 J 



< e H 



^0(1) 

provided n > [ - J 



From the Taylor series expansion, cf. (3.2), we obtain 

F(t) = exp{-q(t)+if(t) + h(t) + p(t)}, where 
ln 5 n s 



\p(t)\ =0 



Therefore, for any e > we have 



for t E U. 



\ F ( t )l_ e -i(t)+Ht) < ee -«(t)+ft(t) and 

_ e -9(*)+*/(*)+fc(t) < ee -l(t)+Ht) forall teU 

provided n > I - J 



The proof of Parts (3) and (4) now follows from (7.1.7) and Part (2). 
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8. Bounding the integral outside of the special points 

We consider the integral representation of Corollary 3.3. Our goal is to show 
that the integral of F(t) for t outside of the neighborhood of the special points 

(0,... ,0) and (±tt, . . . ,±tt) 

is asymptotically negligible. 

In this section, we prove the following main result. 

(8.1) Theorem. Let us fix a number < 5 < 1/2 and let D = (d±, . . . , d n ) be a 
S-tame degree sequence. Let us define subsets U, W C II by 

( In n ~) 

U =\ (n, . . . , r n ) : \tj\ < —= for j = 1,... ,n\ and 

f In n 

W =< (n, . . . , r n ) : |rj — ajn\ < —= for some aj = ±1 
1 y n 

and all j = 1, . . . , n j. 



T/ien /or any k > 



/ |F(f)| dt < n~" [ \F(t)\ 

in\(wuw) Ju 



dt 



provided n > 7(0", k) . 

The plan of the proof of Theorem 8.1 is as follows: first, using some combinatorial 
arguments we show that for any positive constant e > the integral is asymptoti- 
cally negligible outside of the areas where \tj\ < e for all j or where \tj — o~j7i\ < e 
for some o~j = ±1 and all j. Then we note that is log-concave for t in 

a neighborhood of the origin and use a concentration inequality for log-concave 
measures. 

We introduce the following metric p. 

(8.2) Metric p. Let us define a function p : R — > [0, n] as follows: 

p(x) = min \x — 2nk\. 

In words: p{x) is the distance from x to the nearest integer multiple of 2n. Clearly, 
p{—x) = p(x) and p(x + y) < p(x) + p(y) 

for all x, y e R. 
We will use that 

(8.2.1) l--p 2 (x) < cosx < l--p 2 (x). 

2 5 
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(8.3) The absolute value of F(t). Let 

<*{j,k} = 2 ({j,k} (l - C{j,k}) for all j ^ k. 
If D is 5-tame, we have 

(8.3.1) 25 2 < a {m < X - for all j ^ k. 

We have 

\ 1/2 

\ F (t)\ = n ( x ~ a {i' fc > + c ° s fa + rfc )) 

For 1 < j 7^ /c < n let us define a function of r G I, 

f{j,k}( T ) = V 1 ~ a {jM + a {j,k} cost, 

so 

(8-3.2) \F(t)\= ./!,./,}(•/, • v,). 

We note that 

f{j,k}(°) = 1- 
It follows by (8.2.1) and (8.3.1) that for e > 

(8 3 3) /{i.fc}^) ^ exp{- 7 (5,e)}/ 0)A;} (y) 

provided p(x) > 2e and p(y) < e 

for some 7 (e, 5) > 0. Furthermore, 

d 2 . . a{j,k} (<X{j,k} + cost - a {j;k} cost) 

-t-2 ln/ {iifc} (r) = — -2 . 

2 I 1 - a {j,k} + «{j,fc}COsr) 

In particular, by (8.3.1) 

^InWr) < - y for -- < r < - 
and hence haf^j^y is strictly concave on the interval [— tt/3, tt/3]: 

(x + y \ 5 2 

for all x, y G [-7r/3, 7r/3]. 

In what follows, we fix a particular parameter e > 0. All implied constants in 
the "O" and "O" notation below may depend only on the parameters 5 and e. We 
say that n is sufficiently large if n > 7(5, e) for some constant 7(5, e) > 0. 

Our first goal is to show that only the points t G IT for which the inequality 
P fa + T k) < e holds for an overwhelming majority of pairs {j, k} contribute signif- 
icantly to the integral of \F(t)\ on IT. 
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(8.4) Lemma. For t E II, t = (t\, . . . , r n ), and e > let us define a set K(t, e) C 
{1, . . . , n} consisting of the indices k such that 

P ( T j + T k) < e 



for more than n/2 distinct indices j. Let K(t, e) = {1, . . . , n} \ K(t, e). Then 

(1) 

\F(t)\ < exp{-7(5)e 2 n|K(t,e)|| for some ~f(5) > 0; 

(2) 

p{T kl -T k2 ) < 2e for all fa, k 2 e K(t, e); 
(3) Suppose that \K(t,e)\ > n/2. Then 

p(T kl +T k2 ) < 3e for all fa, k 2 G K(t, e). 



Proof. For every k E K(t, e) there are at least (n — 2)/2 distinct j ^ k for which 
(8.4.1) p(Tj+T k ) > e 

and so by (8.2.1) we have 

1 2 

cos(r j + T k ) < 1 - -e . 

Since there are at least \K(t,e)\(n — 2)/4 pairs {j, k} for which (8.4.1) holds, the 
proof of Part (1) follows from (8.3.2) and (8.3.3). 

For any fa, fa G K there is j E {1, . . . , n} such that 

Pfa + i~ kl ), p(Tj + T k2 ) < e. 

Therefore, 

P (T kl ~ T k2 ) =p (Tj + T kl - T k2 - Tj) < p (Tj +T kl )+p (~Tj - Tfc 2 ) 
=p (Tj +T k J+ p (Tj +r k2 ) < 2e 

and Part (2) follows. 

Let us choose a fa E K(t,e). Since \K(t,e)\ > n/2, there is a j E K(t,e) such 
that 

Ph + T kl ) < e. 
By Part (2), for any fa E K(t, e) we have 

p(T kl +T k2 ) = p (-Tj + T kl + Tj + Tfc 2 ) < P (Tj + T kl ) + p (~Tj + T k2 ) 

< 3e 

and Part (3) follows. □ 
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(8.5) Corollary. For an e > let us define a set V(e) C II consisting of the points 
tEH such that 

K(t,e) < Inn, 



where K(t, e) is defined in Lemma 8.4- Then 

[ \F(t)\ dt < n~ n [ \F(t)\ dt 

JU\V(e) Jll 

provided n > 7(5, e) for some constant 7(5, e) > 0. 
Proof By Parts (l)-(3) of Theorem 7.1 we have 



J \F(t)\ dt > Q{n- n ' 2 ^ 



The proof now follows by Part (1) of Lemma 8.4. □ 

Next, we show that only the points f G II such that p (tj + 77-) < e for all 
1 < j,k < n contribute substantially to the integral of \F(t)\ on IT. 

(8.6) Lemma. For e > let us define a set X(e) C IT, 

X(e) = |t G n, t = (n, . . . ,r n ) : p(Tj + Tk)<e for all j, fc|. 

Then 

/ \F(t)\dt < exp{-7!(5,e)n} / \F(t)\ dt 

JU\X(e) Jll 

for all n > 72(5, e) for some constants 71 (5, e), 72(5, e) > 0. 

Proof. Let us consider the set V(e/60) C IT and n large enough so that the con- 
clusion of Corollary 8.5 holds, that is, the integral of \F(t)\ over IT \ V(e/60) is 
asymptotically negligible. For a set A C {1, . . . , n} such that 

\A\ < ln 2 n, 

let us define a set Pa C IT (we call it a piece) such that 

p(Tj — Tk) < e/30 and p(Tj + Tk) < e/20 for all j, /c G A. 

If n is large enough, by Lemma 8.4 for every t G V(e/60) we can choose A = e), 
so we have 

(8.6.1) V(e/60) C |J Pa- 

A: |A|<ln 2 n 
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Our next goal is to show that the integral of \F(t)\ over Pa \ X(e) is negligible 
compared to the integral of |P(t)| over Pa- 

Let us choose a point t G Pa \X(e). Thus we have 

P ( r *o + T jo ) > e for some i ,j . 
Let us choose any ko G A. Then 

P ( r *o + T h) = P ( r *o + r jo + r k - T fco ) < p (r io + r fco ) + p (r J() - r ko ) . 
Hence we have either 

P (ri + T k ) > e / 2 or P ( T jo ~ r k ) > e / 2 - 

In the first case, for every k G A we have 

P (t; + r ko ) =p (r io + r fco + r fc - r fc ) < p (r io + r fc ) + p (r fco - r fc ) 
< P (r io + r fc ) + e/30, 

from which 

p(r io +r k ) > e/2 - e/30 = 7e/15. 
In the second case, for every e A, we have 

P ( r .?o - T-fco) =P ( T J» - T fco + r fc - < P ( r Jo + + P (~ T ko ~ T k) 

=P ( t jo + T k) + P (rk + r k) < P (rj + n) + e/20, 

from which 

P (r jo + r k ) > e/2 - e/20 = 9e/20. 

In either case, for any t G Pa \ X(e), t = (ti, . . . , r n ), there exists an index i A 
such that 

P (n + r k ) > 0.45e for all k G A. 

For i ^ A, we define 

(8.6.2) Q All = {t G Pi : p (r< + T k ) > 0.45e for all k G 

Hence 

P A \X(e) C U^.*- 

Given a point t G Pa, we obtain another point in Pa if we arbitrarily change the 
coordinate Tj G [— 7r, 7r] for i G A. We obtain a /i6er E C Pa if we fix all other 
coordinates and let Tj G [— 7r, 7r] vary. Geometrically, each fiber E is an interval 
of length 2/T. Let us construct a set I C P as follows. We choose an arbitrary 
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ko G A and let vary in such a way that p (r^ + Tj) < 0.05e. Geometrically, / is 
an interval of length O.le or a union of two non-overlapping intervals of total length 
O.le. Moreover, 



p(r k + Ti) < p (r ko + n) + p (r fc - t Uq ) < 0.05e + 0.05e = O.le 

for all k G A and all re/. 

Using (8.3.2) and (8.3.3), we conclude from (8.6.2) that for any t G Qa,i Hi? and 
for any s G Pa Hi we have 

\F(t)\ < exp{-fi(n)}|F(s)| 
provided n is large enough. Therefore, 

/ \F(t)\ dt < exp{-0(n)} I \F{t)\ dt 

JEnQ A ,i JE 

for all sufficiently large n. 

Integrating over all fibers E, we establish that 

/ \F{t)\ dt < exp{-0(n)} I \F(t)\ dt 
JQ A ,i Jp a 

for all sufficiently large n. Since the number of different subsets A C {1, ... , n} 
with \A\ < In 2 n in (8.6.1) does not exceed exp{(9 (in 3 n) }, the proof follows. □ 

Next, we prove that only the points in the neighborhood of the origin or the 
corners of IT contribute significantly to the integral of \F(t) \ over IT. 

(8.7) Lemma. For < e < 1, let X(e) be the set defined in Lemma 8.6. Let us 
define Y(e),Z(e) C II by 

Y(e) =|t G II : t = (ti, . . . , r n ) , | j < e/2 for z = l,...,n| and 
Z(e) =|t G IT : t = (n, . . . , r n ) , jr, — o~iii\ < e/2 /or some ai = ±1 
and aZZ i = 1, . . . , n j. 

T/ien 

X(e) = y(e)UZ(e) and F(e) n Z(e) = 0. 

Moreover, 

/ \F(t)\ dt= I \F{t)\ dt. 

JY{e) JZ{c) 
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Proof. Let us pick a point t G X(e). Then for each k we have p (27/-) < e and hence 
either 

| -7~fe | < e/2 or |rjt — tv\ < e/2 or |t/- + tt\ < e/2. 

Since p (77- + Tj) < e for all we conclude that if |t/-| < e/2 for some k then 
|r fc | < e/2 for all k. Hence X(e) C (Y(e) U Z(e)). The inclusion (Z(e) U Y(e)) C 
X(e) is obvious. Since e < 1, we have Y(e) fl Z(e) = 0. 

The set Z(e) is a union of 2 n pairwise disjoint corners, where each corner is de- 
termined by a choice of the interval [— tt, — 7r + e/2] or [tt — e/2, n] for each coordinate 
Tj. The transformation 



Tk I > 



T k + TV if Tfc G [-7T, -7T + e/2] 
Tfc - 7T if r fc G [tt - e/2, 7r] 



is a volume-preserving transformation which maps Z(e) onto X(e) and does not 
change the value of | F(t) \ . □ 

Finally, we will use that \F(t)\ is strictly log-concave on the set Y(l). For 
Euclidean space V with the norm || ■ ||, a point x G V and a closed set A C F we 
define the distance 

dist(a;, A) = min ||x — y||. 

We will need the following concentration inequality for strictly log-concave mea- 
sures. 

(8.8) Theorem. Let V be Euclidean space with the norm || • ||, let B C V be a 
convex body and let us consider a probability measure supported on B with density 
e~ u , where u : B — > R is a function satisfying 

u{x) + u(y) — 2u I — - — I > c\\x — y\\ 2 for all x,y G B 



and some constant c > 0. 

Let Ac B be a closed subset such that P (A) > 1/2. Then, for r > 0, we have 

p{xG5: dist(x, A) > r } < 2e" cr2 . 

Proof. See Section 2.2 of [LeOl] and Theorem 8.1 and its proof in [Ba97]. □ 
(8.9) Lemma. Let Y(l) C IT be the set defined by 

(1) = jt G n, t = (n, . . . ,r n ) : \n\ < - for z = l,...,n|. 



Y 

Let U C IT be the set 



( In n 1 

W= |t G II, t = (ri,... ,r n ) : I Ti| < —= for i = l,...,nj. 
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Then for any k > we have 

[ \F(t)\ dt < n~ K [ \F(t)\ alt, 

JY(1)\U JY(1) 

provided n > 7(5, k) is big enough. □ 

Proof. Let us consider the probability measure on 1^(1) with density proportional 

to Let us consider a map M :R n — ► where the coordinates of 

are indexed by unordered pairs {j, k} and 

M {j,k} (Tl, • • • ,T„) = Tj + Tfc. 

By Lemma 4.3, 

\\M(t)\\ 2 > (n-2)||t|| 2 for all teR n . 
Since from Section 8.3, 

ln|F(f)| = i J2 ln f{iM (rj + Tk), 

it follows by (8.3.4) that for any constant a and 

u(t) = - \n\F(t)\ + a, 

we have 

u(h) +u(t 2 ) -2u ( t -^ 1 J > 7 (5)n||ti-t 2 || 2 for all h,t 2 eY(l) 



and some constant 7(5) > 0. We choose a so that e u is a probability density on 

y(i). 

We apply Theorem 8.8 with c = 7(5)n. For k = 1, . . . , n, let A k C 1^(1) be the 
set of points with 77 < and let A^" C Y(l) be the set of points with 17 > 0. Since 
both 1^(1) and the probability measure are invariant under the symmetry 1 1 — > —t, 
we have 

P (4)=P(4) = ^ for k = l,...,n. 

Therefore, by Theorem 8.8, all but a n~ K fraction of all points in 1^(1) lie within a 
distance of In n/y/n from each of the sets and A~£, provided n is large enough. 
□ 
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(8.10) Proof of Theorem 8.1. For < e < 1 let us define the set X(e) as in 
Lemma 8.6 and the sets Y{e) and Z(e) as in Lemma 8.7. In particular, 

, , 2 In n \ , , „ , „ ( 2 In n 
U = Y [ — — and W = Z 



y/n J \ \fn 

By Lemma 8.6, for any k > we have 



/ \F(t)\ dt < n~ K [ \F(t)\ 
Jn\x(i) Jn 



dt 



for all sufficiently large n, so that the integral outside of X(l) is asymptotically 
negligible. 

By Lemma 8.7, X(l) = Y(l) U Z(l) with Y(l) n Z(l) = and 



(8.10.1) / \F(t)\dt= [ \F(t)\dt and / \F(t)\ dt = [ \F(t)\ 
Jy(i) Jz(i) Ju JW 



dt. 



'(1) -'Z(l) 

By Lemma 8.9, 



/ \F(t)\ dt < n~ K [ \F(t)\ 
Jy(i)\u Jy(i) 



d,t 



for all sufficiently large n, so that the integral over Y(l) \ ti is asymptotically 
negligible. By (8.10.1), the integral over Z(l) \ W is asymptotically negligible. The 
proof now follows. □ 

9. Proof of Theorem 1.4 

By Corollary 3.3, we have the integral representation for the number \G(D) \ of 
graphs: 

7 H(z) 



|G(B)l ^/ n F(i)dt - 



Let us define subsets W, W C IT as in Theorem 8.1. Let us consider the transfor- 
mation W — > U, 

f T k + 7T if - 7T < T k < -7T + ^ 

r fc i— )■ <^ for fe = 1,... ,n. 

[ Tfc - 7T if 7T - < T k < TV 

As in the proof of Lemma 8.7, this is a measure-preserving transformation which 
maps W onto W. Since d\ + . . . + d n is even, the transformation does not change 
the value of F(t) (if d\ + . . . + d n is odd, the transformation changes the sign of 
F(t)). Hence 

(9.1) / F(t) dt= I F(t) dt. 
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By Theorem 7.1, the integrals of F(t) and \F(t)\ over U have the same order of 
magnitude, that is, 



\F(t)\dt < 7 (5) 



u 



F(t) dt 



u 



for some constant 'j(S) > 1. Therefore, from Theorem 8.1, the integral outside of 
U U W is asymptotically negligible, so that for any k > and all sufficiently large 

n > 7(^7 «0) we have 



F(t) dt 



\F(t)\ dt < n~ K 
'n\(wuw) Ju 

The proof now follows by Parts (2) and (4) of Theorem 7.1, identity (9.1) and the 
formula 



/ 



(2tt)"/ 2 
v/detQ' 



□ 



10. Proof of Theorem 1.6 



The proof is very similar to that of Theorem 3 of [BalO], which deals with a 
similar situation in the case of bipartite graphs. All implicit constant in the "O" 
and "O" -notation below may depend only on the parameter 6 > 0. 

For pairs 1 < j 7^ k < n, let %{j,k} be independent Bernoulli random variables 
such that 

P{x {jM = 1} = C{j,k} and P{x {jM = 0} = 1 -C{j,k}- 

As is implied by Theorem 5 of [BH10], the probability mass function of the random 
vector X = (^{j,fc}) is constant on the integer points of V(D) and is equal to e~ H ^ 
at each G E G(D). 
Let us define 

{j,k}es 

Then 

p{GeG(D): a s (G) < (l-e)<7s(*)} 

P|X: a s (X) < (l-e)a s (z) and IgG(D)} 



jx : X E G(D)} 



and, similarly, 



P |c e G{D) : a s (G) > (l + e)as(z)) 

P|X: a s (X) > (l + e)a s (z) and X E G(D)j 



P|X: I6G(D)} 
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Applying Theorem 1.4 and Parts (1) and (2) of Theorem 7.1, we get 

P{X: I6G(D)} =e~ H ^\G{D)\ > n~ oin \ 

On the other hand, standard large deviation inequalities for sums of bounded in- 
dependent random variables (see, for example, Corollary 5.3 of [McD89]) imply 
that 

p{X: a s (X) > (l + e)<7 S (z)} < exp { -O (n In 2 n) } 
and, similarly, 

P|X: (7 S (X) < (l-e)as(z)} < exp { -0 (n In 2 n) } 

and the proof follows. □ 

11. Counting bipartite graphs 

Here we list some modifications needed to establish the asymptotic formula 
(2.5.4). We adhere to the notation of Section 2.5. 

As in Corollary 3.3, we represent the number 0-1 matrices with row sums R = 
(ri, . . . , r m ) and column sums C = (ci, . . . , c n ) as an integral. Let us define 

{m n i 

j = l k=l I l<j<m 

1 l<k<n 

for (s,t) = (oi,... ,<r m ;ri,... ,r n ) 
and let IT C R m+n be the parallelepiped 

n =|(a"i, . . . ,cr m ;ri, . . .r n ) : -7r < (Jj,T k <7r 
for j = 1, . . . , m; = 1, . . . , n j. 

Let Ilo C IT be the facet of IT defined by the equation r n = 0. 

Since the constraints are not independent (the sum of all row sums is equal to 
the sum of all column sums) , we can drop one of the constraints and represent the 
desired number \R, C\ as an integral over n , 



R,c\ = , n \ z — T / F ( s ^) dsdt , 

I ( 27r )m+n-l V I 



cf. Section 2 of [BH09a]. 
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Let WcIIbe the neighborhood of the origin, 



f, .... Inn 



71 



for j = 1, . . . , m; fc = 1, . . . , n j 



and let Uo be the intersection of U with the hyperplane r n = 0. We prove that the 
integral 

\F(s,t)\ dsdt 



J 

Jn, 



'n \u 

is asymptotically negligible relative to the integral 
(11.1) / \F(s,t)\ dsdt. 

The proof is a modification of that of Theorem 8.1 (note that here we don't have 
another set W C IT contributing large values of \F(s,t)\) and very similar to that 
of Theorem 7.1 of [BH09a]. A different line of proof can be inferred from [BH09b]. 
Our next goal is to evaluate asymptotically as m, n — > +oo the integral 



(11.2) / F(s,t)dsdt. 

In particular, we need to show that (11.2) and (11.1) have about the same order of 
magnitude. From (3.2), we can write the expansion 

F(s,t) = exp{-q(s,t) + if(s,t) + h(s,t)}(l + o(l)^ for (s,t) eU, 

as m,n — > +oo, where q, f and h are as defined by formulas (2.5.1) and (2.5.3) 
respectively. 

We note that q(s,t) is not strictly positive definite, since its kernel is spanned 
by the vector u G M m + n defined by (2.5.2). However, the restriction of q onto 
any hyperplane L C R m+n which does not contain u is strictly positive definite and 
allows us to define the Gaussian probability measure in L with density proportional 
to e~ q . It is easy to prove (see Lemma 3.1 of [BH09a]) that the expectation of any 
polynomial in the sums Oj + tu does not depend on the choice of L. To evaluate 
(11.2), we need to show that asymptotically 

Eexp{z/ + /i} =exp|-^E/ 2 + E/i| (l + o(l)), 

if we choose the hyperplane L defined by the equation r n = 0. However, since the 
expectation on the left hand side does not depend on the choice of the hyperplane 
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L, we can choose L in such a way that 



|ETjTfc| , lEojCTfcl = O I — ^— ) provided j ^ k 



mn 

(11.3) |Ecr 7 -r fe | = o(— J for all j, fc and 

\ mn / 

Eaf , Erf = f — ^— ) for all j, fc. 

As is shown in [BH09a] (see Theorem 3.2 there), to ensure (11.3), one has to choose 
L defined by the equation 

m n 

Y a i (J i = Y ^ fcTfc ' where 

3 = 1 k=l 

n m 

a i = Yl (0* - Cffc) and Pk = J2(tjk-Cjk)- 

k=l j=l 

The proof then proceeds as in Theorem 1.4. 

12. Proof of Theorem 2.1 

In what follows, it is convenient to define the polytope V(D) C r( 2 ) for positive, 
not necessarily integer, sequences D = (di, . . . , d n ). Recall that V(D) consists of 
the vectors (£,{j,k}) for 1 < j / K n such that 

£{j,fc} = d k for fe = 1, - - - ,n 

3- 3^ k 

and 

< £ {jtk} < 1 for 1 < j ^ k < n. 

We say that V(D) has a non-empty interior if there is a point j/ G V(D), y = 
(r){j,k})i such that 

< < 1 for all I < j ^ k < n. 

The following two lemmas are probably known in greater generality, but since 
we are unable to provide a precise reference, we prove only the parts we need to 
obtain Theorem 2.1. 

(12.1) Lemma. Let D = (d\, . . . , d n ) be a sequence of positive rational numbers 
such that 

d\ > . . . > d n 
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and the Erdos-Gallai conditions 

k n 

dj < k(k — 1) + mm {k,di} for k = l,...,n 

i=l i=k+l 

are satisfied. Then the polytope V(D) is non-empty. 

Proof. Let q be a positive integer such that qdi are even integer for % = 1, . . . , n. 
Clearly, V(D) is non-empty if and only if the dilated polytope qV(D) is non-empty. 
By Theorem 6.3.5 of [BR91] there exists an n x n symmetric non-negative integer 
matrix with zero trace, row/column sums qdi, . . . , qd n and the entries not exceeding 
q if and only if 

k n 

^^qdi < qk(k — 1) + min {qk, qdi} k = l,...,n. 

i=l i=k+l 

Hence if the Erdos-Gallai conditions are satisfied, the polytope qV(D) is non-empty, 
and hence the polytope V(D) is non-empty. □ 

Next, we prove a sufficient condition for the polytope V(D) to have a non-empty 
interior. 

(12.2) Lemma. Let D = (d\, . . . , d n ) be a sequence of positive integers such that 

di > . . . > d n 
and the strict Erdos-Gallai conditions 

k n 

dj < k(k — 1) + mm{k,di} for k = l,...,n 

i=l i=k+l 

are satisfied. Then V(D) has a non-empty interior. 

Proof. For a sufficiently small rational e > 0, let us define 

a < \ <U-(n-l)e . 

diie) = 1 _ 2e or l = 1 T--, n - 

Clearly, di(0) = di and 

di(e) >...>d n {e). 

For all sufficiently small e > we have di(e) > for % = 1, . . . , n and the Erdos- 
Gallai conditions of Lemma 12.1 are satisfied for di(e). Therefore, by Lemma 
12.1, the polytope V e = V{D e ) for D = (di(e),... ,<i n (e)) is non-empty. Let 
x = x E V e , be a point. Then the point y = (r}{j,k}) defined by 

V{j,k} = (1 - 2e)£ {ijfc} + e for all l<j^k<n 

is the desired interior point in V(D). □ 

Next, we prove that our conditions on the minimum and maximum degree ensure 
that V(D) has a non-empty interior. 
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(12.3) Lemma. Let us fix real numbers < a < f3 < 1 such that 
(3 < l^fa — a, or, equivalently, (a + (5) 2 < 4a. 



and let D = (di, . . . , d n ) be an integer sequence such that 

di 

a < < p for i = l,...,n. 

n — 1 



Then for 



n > max< — — , ; t^KI 

a(l - /?) 4a - (a + f3) 2 



the polytope V(D) has a non-empty interior. 
Proof. Without loss of generality, we assume that 

di > . . . > d n . 

Let us show that the strict Erdos-Gallai conditions 

k n 

dj < k(k — 1) + ^ min{k,di} for k = 1, 

i=l i=fc+l 

are satisfied. 

We consider three different cases for /c. 
Suppose that k < a(n — 1). Then 



n 



< A;/3(n — 1) and min {/c, <ij} = /c for all i. 

i=i 

Therefore, 

k(k-l)+ ^ min {/c, = k(k - 1) + k(n - k) = k(n - 1) 

i=A:+l 

and the strict Erdos-Gallai conditions are satisfied. 
Suppose that k > (3{n — 1). Then 

k 

dj < k/3(n — l) and min {/c, di} = di > a(n — 1). 
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If k > /3(n — 1) + 1 then k(k — 1) > k/3(n — 1) and the strict Erdos-Gallai conditions 
are satisfied. If /3(n — 1) < k < (3{n — 1) + 1 then 



k(k-l)+ min{fc,d»} > p(n - l)(k - 1) + a(n - l)(n - k) 

i=k+l 

= fik(n - 1) + (n - 1) (a(n - k) - 

> fik(n - 1) + (n - 1) (a(n - 1)(1 - 0) - /?) 

and the strict Erdos-Gallai conditions are satisfied provided 
Finally, suppose that k = 7(71 — 1) for some a < 7 < /3. Then 

A; 

dj < k/3(n — 1) = 7/3(n — l) 2 and min {/c, di} > a(n — 1). 

i=i 

Therefore, 



fc(fc-l) + min {M*} > 7 2 (n- l) 2 - (7-a)(n- 1) + (1 - 7 )«(n- l) 2 . 

i=fc+l 

The minimum value of the function 

7 1 — > 7 2 + (1 - 7)0; - 7/3 
is attained at 7 = (a + ft)/ 2 and equal to 

4 

Therefore, the strict Erdos-Gallai conditions are satisfied, provided 

4(0 -a) 
4a - (a + /3) 2 

The proof now follows by Lemma 12.2. □ 
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(12.4) Proof of Theorem 2.1. By Lemma 12.3, the polytope V(D) contains a 
point y = {v{j,k}) such that < t]{j,k} < 1 f° r J, fc. First, we show that the 
maximum entropy matrix z lies in the interior of V(D), that is, < C{j,k} < 1 f° r 
all j, fc. 
We have 

9 -H(x)=ln 1 -^} 



We note that the value of the derivative is +oo at C{j,k} = (we consider the right 
derivative there), is — oo at C{j,k} = 1 ( we consider the left derivative there) and is 
finite for < C{j,k} < 1- Therefore, if for the maximum point z and some j ^ k we 
have C{j,k} £ {0, 1} then for z = (1 — e)z + for a sufficiently small e > 0, we have 
z G "P(-D) and (z) > H(z), which is a contradiction. 

Since the maximum value of H is attained at an interior point of V(D), the 
gradient of H at the maximum point is orthogonal to the affine span of V(D), that 
is, 

1 ~ C{j,fc } 

C{j,k} 

or, equivalent ly, 



l„_Wi = A . +Aj , ) 



(12.4.1) C{i>fc} = _2__ for all 1 < < n 

for some real Ai, . . . , A n . Without loss of generality, we assume that 

(12.4.2) Ai < Xj < X n for all j. 
From the choice of e in Theorem 2.1, it follows that 

(12.4.3) e < a and /3 < 2^a-e-a. 
Our next goal is to show that 

(12.4.4) X n < 21n^. 
Aiming for a contradiction, suppose that 

X n > 2 In-. 

e 

Then, necessarily, 

Ai < lne 

since otherwise by (12.4.1) and (12.4.2) we have 

1 1 

~ i + e \j+\ n - i _|_ e Ax+A„ < e 
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and 

which by (12.4.3) contradicts the lower bound for d{. 

Since Ai < lne and X n > —2 lne, we deduce from (12.4.1) that for 1 < j < n we 
have 

G-j n \=- , , ; < r— <e provided A, > 

(12.4.5) and 

C{i,j}= 1 + e 1 A 1 +A J > YT^r >1_e P rovided A ^°- 

Denoting 

1 

T = C iW = i + e A 1+ A„ < 
by (12.4.1) and (12.4.2) we obtain that for 1 < j < n we have 

1 1 

M 1 ^'} ~1 + e A 1 +A j - I + e Ax+A„ _ T 

(12.4.6) and 

1 1 _ 

~1 _(_ e A J +A„ - i + e Ai+A„ - T - 

Let 

| {1 < j < n : Xj < 0} | = 7(71 - 1) for some < 7 < 1. 
Combining (12.4.5) and (12.4.6), we obtain 

P(n - 1) > di = C{i,j} = + 

j: j/1 A; (I j 1 : A, (I 

> (1 - e)-y(n - 1) + (n - 1)(1 - 7)t 

and 

a(n - 1) < rf„ = ^» = S ^» + 

< e(l - 7)(n - 1) + (n - 1)7T. 

Consequently, 

/3 > (1 — e)7 + (1 — 7)r and a < e(l — 7) + 7T. 

Therefore, 

/3 + e > 7 + (1 — 7)r and a — e < 7T. 
50 



Since the function — x is increasing for < x < 1, from (12.4.3) it follows that 

7 + (1-7)t < 2 v /tt-7t, 

or, equivalent ly, 

7 + r — 
— < v^> 

which is a contradiction. 

The contradiction shows that (12.4.4) indeed holds. Then, by (12.4.2), we have 

Xj < 2 In- for j = 1, ...,n. 

We claim now that 

(12.4.7) Ai > 3 hie. 

Indeed, if Ai < 31ne then by (12.4.1) 

Gi ,i = -r tt > — — — > 1 — e for j = l,...,n— 1 

M ' n l + e Al+A i 1 + e 

and 

f3(n-l) > d 1= >(n-l)(l-e), 

which contradicts (12.4.3). 

Summarizing, from (12.4.4) and (12.4.7), we obtain 

e 4 1 11 

r < ?rr- < C-r-i frl < ?rr- < f for all ? 7^ fc, 

1 + e 4 ~ 1 + e 2A « ~ S|J ' 1 ~ 1 + e 2A i ~ 1 + e 6 J ^ 

which completes the proof. □ 
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